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SEQUENCE LISTING 

<110> Diversa Corporation 
Morgan, Brian 
Burk, Mark 
Levin, Michael 
Zhu, Zoulin 
Chaplin, Jennifer 
Kustedjo, Karen 
Huang, Zilin 
Greenberg, William 

, <120> METHODS FOR MAKING SIMVASTATIN AND INTERMEDIATES 

<130> 564462012840 

<140> Not Yet Assigned. 
<141> 2004-10-20 

<150> US 60/542,100 
<151> 2004-02-04 

<150> US 60/513,237 
<151> 2003-10-21 

<160> 6 • 

<170> Patentin version 3.1 

<210> 1 

<211> 1629 

<212> DNA 

<213> Unknown 

<220> 

<Z23>' Environmental 
<400> 1 

atgagccttt gcgtcattcg attcatcgcc ggaactttgg tacttgtggc gtcagtggaa 
tcggcagttg ctcaacaagc gtgtgctgac ctgatgggcc tcgagctgcc gtatacaacg 120 
ataacgtccg ctgcagtggc taccgagggc ccaatcccac agccggcgat ctttggaagc 
actgacccca ttgtggctcc agagcgatgt gaagtgcggg cggtcacgcg ccctacgaag 
gactccgaga ttcgaatcga gctctggctg ccgctctccg gatggaacgg aaagtatcta 
caaattggta gcggtggctg ggctggttcg atcaatcgaa cggggctgat aggccctctt 
cagcgcggtt atgccgtagc cgcaaccgac aatggccata tcagcgaagg tttggtgcct 420 
gacgcctcct gggctatcgg ccatccgcaa aagctgatcg atttcggtta tcgcgccgtg 480 
cacgaaacaa gtgttcaggc caaagctatc ctgcgcgcct actttggccg cggtcaggat 540 
ctgagctact tcagcggttg ttctaatggc ggacgcgagg ctctcatgga ggcgcagcgc 600 
tatccggaag atttcgaagg catcatcgcc ggagcgcccg cgaacaattg gtcgcgcctg 660 
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60 



180 
240 
300 
360 
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tttacggggt ttgtgtggaa tgaacgcgcg ttggcggacg atccaattcc tcctgccaag 
ttgacagcga ttcaggcggc ggcaattgct gcgtgtgata cgctggacgg tgttgaggac 
gggctcatcg aaaacccacg agcgtgtagc ttcgatccgc gttcaatggt ctgtacagcc 
gatgatgcct ctgactgtct gacagaagga caggtcgcga cgctacacag gatatatagc 
ggcccaacca atcctcggac cggtgagcga atctttccag gctatccgat gggcaccgaa 
gccgtgccgg gcggatgggt accgtggatc gtgtccgcga gctccgaagt tccgagcata 
caagcaagct ttggcaactc ctattacggg cacgcggtct tcgagcaatc gaactgggat 
ttcaggacgt tggatttcga ccaggacgtt gcgtttggcg atgcgaaggc ggggccggtg 
ctcaatgcca cgaaccccga tctgcgttcg tttcgcgcga atggcggcaa actgattcag 
tatcatggct ggggcgatgc agccattacg gcttttagtt cgatcgacta ctacgagaac 
gtgcgcgcct tcctcgatcg cttccccgac ccccgaagcg agaacacgga tatcgacggt 
ttctatcgcc tgttcctggt tccgggcatg ggacattgct ccggcgggat cggcccaagt 
agctttggca atggcttccg ttccgcacgt acggatgccg agcacgacct actctccgcc 
cttgaggcat gggtggagcg agacacggcc ccggagagat tgatcggaac ggggacggcc 
gtaggcgacc caaccgcgac tctgacgcgt ccgctatgcc catatccgcg gacggcacgg 
tatctcggaa gcggcaactc aaatgatgcg gccaacttcg agtgcgccct gcccgctggc 
gtgcagtag 



720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1629 



<210> 2 
<211> 542 
<212> PRT 
<213> Unkovm 

<220> 

<223> Environmental 
<220> 

<221> SIGNAL 
<222> (1) . . . (24) 



Met Ser Leu Cys ' 



Ala ser Val Glu Ser Ala Val Ala Gin Gin Ala Cys Ala Asp Leu Met 



Gly Leu Glu Leu Pro Tyr Thr Thr He Thr Ser Ala Ala Val Ala Thr 
35 40 
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Glu Gly Pro He Pro Gin Pro Ala He 
50 55 



Phe Gly Ser Thr Asp Pro He 
60 

val Ala Pro Glu Arg Cys Glu Val Arg Ala Val Thr Arg Pro Thr Lys 
65 70 75 

ASP ser Glu He Arg He Glu Leu Trp Leu Pro Leu Ser Gly Trp Asn 
85 

Gly Lys Tyr Leu Gin He Gly Ser Gly Gly Trp Ala Gly Ser He Asn 
100 

Arg Thr Gly Leu He Gly Pro Leu Gin Arg Gly Tyr Ala Val Ala Ala 
H5 120 125 

Thr ASP Asn Gly His He Ser Glu Gly Leu Val Pro Asp Ala Ser Trp 
130 135 

Ala He Gly His Pro Gin Lys Leu He Asp Phe Gly Tyr Arg Ala Val 
145 150 155 

His Glu Thr ser Val Gin Ala Lys Ala He Leu Arg Ala Tyr Phe Gly 



Arg Gly Gin Asp Leu Ser Tyr Phe Ser Gly Cys Ser Asn Gly Gly Arg 
180 185 

Glu Ala Leu Met Glu Ala Gin Arg Tyr Pro Glu Asp Phe Glu Gly He 
195 200 205 

He Ala Gly Ala Pro Ala Asn Asn Trp Ser Arg Leu Phe Thr Gly Phe 
210 215 220 



val Trp Asn Glu Arg Ala Leu Ala Asp Asp Pro He Pro Pro Ala Lys 
225 230 235 

Leu Thr Ala He Gin Ala Ala Ala He Ala Ala Cys Asp Thr Leu Asp 
245 250 

Gly val Glu ASP Gly Leu He Glu Asn Pro Arg Ala Cys Ser Phe Asp 
260 , 265 

Pro Arg Ser Met Val Cys Thr Ala Asp Asp Ala Ser Asp Cys Leu Thr 
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Glu Gly Gin val Ala Thr Leu His Arg He Tyr Ser Gly Pro Thr Asn 
290 295 300 

Pro Arg Thr Gly Glu Arg He Phe Pro Gly Tyr Pro Met Gly Thr Glu 
305 310 315 320 

Ala Val Pro Gly Gly Trp Val Pro Trp He Val Ser Ala Ser Ser Glu 
325 330 335 

Val Pro Ser He Gin Ala Ser Phe Gly Asn Ser Tyr Tyr Gly His Ala 
340 345 350 

Val Phe Glu Gin Ser Asn Trp Asp Phe Arg Thr Leu Asp Phe Asp Gin 
355 360 365 

Asp Val Ala Phe Gly Asp Ala Lys Ala Gly Pro Val Leu Asn Ala Thr 
370 375 380 

Asn Pro Asp Leu Arg Ser Phe Arg Ala Asn Gly Gly Lys Leu He Gin 
385 390 395 400 

Tyr His Gly Trp Gly Asp Ala Ala He Thr Ala Phe Ser Ser He Asp 
405 410 415 



Tvr Tyr Glu Asn Val Arg Ala Phe Leu Asp Arg Phe Pro Asp Pro Arg 
420 425 430 



ser Glu Asn Thr Asp He Asp Gly Phe Tyr Arg Leu Phe Leu Val Pro 
435 440 445 



Gly Met Gly His Cys Ser Gly Gly He Gly Pro Ser Ser Phe Gly Asn 
450 455 460 



Gly Phe Arg Ser Ala Arg Thr Asp Ala Glu His Asp Leu Leu Ser Ala 
465 470 475 480 

Leu Glu Ala Trp Val Glu Arg Asp Thr Ala Pro Glu Arg Leu He Gly 



485 



490 



495 



Thr Gly Thr Ala Val Gly Asp Pro Thr Ala Thr Leu Thr Arg Pro Leu 
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CVS Pro Tyr Pro Arg Thr Ala Arg Tyr Leu Gly Ser Gly Asn Ser Asn 
520 525 



515 



, Ala Ala Asn Phe Glu Cys Ala Leu Pro Ala Gly Val Gin 
530 535 540 



<210> 3 
<211> 1209 
<212> DNA 
<213> Unknown 



<220> 

<223> Environmental 



aJggaaatcc atggtacatg cgacccaaag tttcacttgg tgcggcagga gtttgaacga 60 
aatttgcgtg agcgcggcga agtaggagcg tccgtttgcg tcacgttgca cggcgaaacc 120 
gtagtggact tgtggggcgg catggcgcgt gccgacactc agacgccatg gacggcggag 
acggtcagta ttgttttttc ctccaccaaa ggcgcaacgg cactctgcgc ccatatgctg 
gcgtcacgcg gccaactgga tcttgatgca ccagtcgcca cctactggcc ggaatttgcc 
caagccggca aagctcgcat cccggtgaaa atgctcttga accatcaagc tggtctccct 
gccgtacgga caccgctgcc ccagggtgcc tacgctgact gggaactgat ggtcaatabg 
ttggccaagg aagagccgtt ttgggaacct ggcacccgca acggctatca tgcgctcacc 
atggggtggc tggtgggaga agtggtgcga cgtgtctctg gtaagtcgct tgggacattc 
ttccaagagg agatcgccag gccgttgggg ttagatttct ggattggctt accagcagag 
caagaggcac gggtcgcgcc gatgatcgcg gcggagcctg atccgcaaag cctcttcttc 
caagaggtcg cgaagcctgg ggccttacag tcgctcgtac tccttaactc cggcggctat 
atgggtgctc agcctgagta tgactcgcgg gcggcgcatg cggccgagat tggtgcagcc 
ggtggtatca ccaacgcacg cggcctggca ggcatgtacg caccactggc ctgcggaggc 840 
aaactcaaag gggtggagtt ggtcagtcct gacatgctgg cccgaatgtc cagagtggcc 900 
tctgcgactg ggagagatgc cgtgctcatg atgccaaccc ggtttgccct gggcttcatg 960 
aagtccatgg acaaccgccg ggagcctgct ggcgtgcagg acagcgcgct ctttggggag 1020 
gaggcttttg gccatgtggg ggccgggggt tcgtttggtt ttgccgatcc caaagcagga 1080 
atgtcctttg gctataccat gaaccgaatg gggctgggag ccgggctcaa cccgcggggg 1140 
caaagcctgg tggatgcaac ctaccgctcg ttagggtatc agtcggatgc ctctggagcc " 1200 

1209 

tggacctga 



180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
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<210> 4 
<211> 402 
<212> PRT 
<213> Unkown 

<220> 

<223> Environmental 
<220> 

<221> DOMAIN 
<222> (18) . . . (386) 
<223> Beta-lactamase 

Met°Glu lie His Gly Thr Cys Asp Pro Lys Phe His Leu Val Arg Gin 



, Asn Leu Arg Glu Arg Gly Glu Val Gly Ala Ser Val 
25 30 

cys val Thr Leu His Gly Glu Thr Val Val Asp Leu Trp Gly Gly Met 
35 40 

Ala Arg Ala Asp Thr Gin Thr Pro Trp Thr Ala Glu Thr Val Ser He 
50 55 

val Phe ser Ser Thr Lys Gly Ala Thr Ala Leu Cys Ala His Met Leu 
65 70 75 

Ala ser Arg Gly Gin Leu Asp Leu Asp Ala Pro Val Ala Thr Tyr Trp 
85 30 

pro Glu Phe Ala Gin Ala Gly Lys Ala Arg He Pro Val Lys Met Leu 
100 105 

Leu Asn His Gin Ala Gly Leu Pro Ala Val Arg Thr Pro Leu Pro Gin 
115 120 125 

Gly Ala Tyr Ala Asp Trp Glu Leu Met Val Asn Thr Leu Ala Lys Glu 
130 135 

Glu pro Phe Trp Glu Pro Gly Thr Arg Asn Gly Tyr His Ala Leu Thr 
145 150 155 

Met Gly Trp Leu Val Gly Glu Val Val Arg Arg Val Ser Gly Lys Ser 
165 I'O 
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Leu Gly Thr Phe Phe Gin Glu Glu He Ala Arg Pro Leu Gly Leu Asp 
180 185 190 

Phe Trp He Gly Leu Pro Ala Glu Gin Glu Ala Arg Val Ala Pro Met 
195 200 205 

He Ala Ala Glu Pro Asp Pro Gin Ser Leu Phe Phe Gin Glu Val Ala 
210 215 220 

Lvs Pro Gly Ala Leu Gin Ser Leu Val Leu Leu Asn Ser Gly Gly Tyr 
225 230 235 240 

Met Gly Ala Gin Pro Glu Tyr Asp Ser Arg Ala Ala His Ala Ala Glu 
245 250 255 

He Gly Ala Ala Gly Gly He Thr Asn Ala Arg Gly Leu Ala Gly Met 
260 265 270 

Tyr Ala Pro Leu Ala Cys Gly Gly Lys Leu Lys Gly Val Glu Leia Val 
275 280 285 

Ser Pro Asp Met Leu Ala Arg Met Ser Arg Val Ala Ser Ala Thr Gly 
290 295 300 

Arg Asp Ala Val Leu Met Met Pro Thr Arg Phe Ala Leu Gly Phe Met 
305 310 315 320 

Lys Ser Met Asp Asn Arg Arg Glu Pro Ala Gly Val Gin Asp Ser Ala 
325 330 335 

Leu Phe Gly Glu Glu Ala Phe Gly His Val Gly Ala Gly Gly Ser Phe 
340 345 350 

Gly Phe Ala Asp Pro Lys Ala Gly Met Ser Phe Gly Tyr Thr Met Asn 
355 360 365 

Arg Met Gly Leu Gly Ala Gly Leu Asn Pro Arg Gly Gin Ser Leva Val 
370 375 380 

Asp Ala Thr Tyr Arg Ser Leu Gly Tyr Gin Ser Asp Ala Ser Gly Ala 
385 390 395 400 



Trp Thr 
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<210> 5 
<211> 1578 
<212> DNA 
<213> Unknown 

<220> 

<223> Environmental 

atgagatcag cagctcgcat cagcgtggcg gcagttgcct ttctttgcct gctcttgacg 
actcgggttt ccgcccagat cgtgccggcg atggaatgtg cggatctggc gaatcagcag 
cttcccaaca cgacgatcac ctcggcccag accgtcacca ccggatcgtt aacgcccccg 
ggctcgacga atccgatcac cgacctgcct cctttctgcc gtgtcacagg cgccatcgcc 
ccgacgagcg agtcgcacat cctcttcgag gtctggctgc cgctggataa atggaacggc 
aagttcgccg gcgtgggcaa cggcggctgg gccggcatca tctccttcgg cgccctcgga 
agccagctca agcgcggcta cgcgaccgcc tccacgaata cgggtcacga agcggcgccg 
gggatgaacg cagccaggtt tgcgttcgag aagccggagc agcttatcga cttcgcctat 
cgctcccagc acgagacggc cctgaaagcg aaggcgctgg ttcaggcttt ctacgggaag 
ccgccggaac actcctattt catcgggtgc tcatcgggtg ggtaccaggg cctgatggag 
gcccaacgat ttccggccga ctacgacggg atcgtcgccg gtatgccggc gaacaactgg 
acacggctga tggccggcga cttggacgcg atccttgccg tctccgtaga tcctgccagc 
caccttcccg tctccgcatt gggtctgttg tatcgctcgg tgctcgctgc ctgcgacggc 
atcgacggtg ttgtagacgg tgttctggag gatccgcgcc gatgccggtt cgacccggcc 
gtgttgatgt gcaaggcgga tcagaatccc gatggctgcc ttacgccggc tcaggtggaa 
gcggcacggc gcatatacgg cggtctgaag gatcccaaga ccggcgctca gctctatccg 
gggctggcgc cgggaagcga gccgttctgg ccgcaccgca atccggcgaa tccgttccct 
attccgatcg cgcactacaa gtggctcgtc tttgccgatc caaactggga ttggagaaca 
ttcaagttca cggatccggc ggactaccag gctttcctca atgcggaagc cacgtatgcc 
cctactctca atgcgaccaa tccggacctc cgggagttca gccggcgcgg cggcaggttg 
attcagtacc atggctggaa cgatcagctg attgccccgc aaaacagcat cgactattac 
gagagcgtcc tttcgttctt cgggtccggc aaacaggatc gagcgcagac cgtgcgcgag 
gttcagagct tctaccggct gttcatggcg ccgggtatgg ctcactgtgg aggcggtaca 
ggtccgaact catttgacat gctggatgcc ctcgagaagt gggtggaagg cgggatagcg 
ccggaacgag tccttgcgac gcgttccata aacggcgtag tcgaccggct gcgcccgctc 

8 
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120 
180 
240 
300 
360 
420 
480 
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600 
660 
720 
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960 
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1140 
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tgtccatatc cgcaggtcgc cgtgtacaag ggtcatgggg atacaaacga cgccgcgaac 1560 
ttcgtctgtc gcgattag 1578 

<210> 6 
<211> 525 
<212> PRT 
<213> Unkown 

<220> 

<223> Environmental 
<220> 

<221> SIGNAL 
<222> (1) . . . (25) 

<400> 6 

Met Arg Ser Ala Ala Arg He Ser Val Ala Ala Val Ala Phe Leu Cys 
15 10 15 

Leu Leu Leu Thr Thr Arg Val Ser Ala Gin He Val Pro Ala Met Glu 
20 25 30 

Cys Ala Asp Leu Ala Asn Gin Gin Leu Pro Asn Thr Thr He Thr Ser 
35 40 45 

Ala Gin Thr Val Thr Thr Gly Ser Leu Thr Pro Pro Gly Ser Thr Asn 



Pro He Thr Asp Leu Pro Pro Phe Cys Arg Val Thr Gly Ala He Ala 
65 70 75 80 



Pro Thr Ser Glu Ser His He Leu Phe Glu Val Trp Leu Pro Leu Asp 

85 90 95 



Lys Trp Asn Gly Lys Phe Ala Gly Val Gly Asn Gly Gly Trp Ala Gly 
100 105 110 



He He Ser Phe Gly Ala Leu Gly Ser Gin Leu Lys Arg Gly Tyr Ala 
115 120 125 



Thr Ala Ser Thr Asn Thr Gly His Glu Ala Ala Pro Gly Met Asn Ala 
130 135 140 



Ala Arg Phe Ala Phe Glu Lys Pro Glu Gin Leu He Asp Phe Ala Tyr 
145 150 155 160 
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Arq Ser Gin His Glu Thr Ala Leu Lys Ala Lys Ala Leu Val Gin Ala 
• 165 170 175 

Phe Tyr Gly Lys Pro Pro Glia His Ser Tyr Phe He Gly Cys Ser Ser 
180 185 190 

Glv Gly Tyr Gin Gly Leu Met Glu Ala Gin Arg Phe Pro Ala Asp Tyr 
195 200 205 

AsPvGly He Val Ala Gly Met Pro Ala Asn Asn Trp Thr Arg Leu Met 
^210 215 220 

Ala Gly Asp Leu Asp Ala He Leu Ala Val Ser Val Asp Pro Ala Ser 
225 230 235 240 

His Leu Pro Val Ser Ala Leu Gly Leu Leu Tyr Arg Ser Val Leu Ala 
245 250 255 

Ala Cys Asp Gly He Asp Gly Val Val Asp Gly Val Leu Glu Asp Pro 
260 265 270 

Arg Arg Cys Arg Phe Asp Pro Ala Val Leu Met Cys Lys Ala Asp Gin 
275 280 285 

Asn Pro Asp Gly Cys Leu Thr Pro Ala Gin Val Glu Ala Ala Arg Arg 
290 295 300 

He Tyr Gly Gly Leu Lys Asp Pro Lys Thr Gly Ala Gin Leu Tyr Pro 
305 310 315 320 

Gly Leu Ala Pro Gly Ser Glu Pro Phe Trp Pro His Arg Asn Pro Ala 
325 330 335 

Asn Pro Phe Pro He Pro He Ala His Tyr Lys Trp Leu Val Phe Ala 
340 345 350 

Asp Pro Asn Trp Asp Trp Arg Thr Phe Lys Phe Thr Asp Pro Ala Asp 
355 360 365 

Tyr Gin Ala Phe Leu Asn Ala Glu Ala Thr Tyr Ala Pro Thr Leu Asn 
370 375 380 

Ala Thr Asn Pro Asp Leu Arg Glu Phe Ser Arg Arg Gly Gly Arg Leu 
385 390 395 400 
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lie Gin Tyr His Gly Trp Asn Asp Gin Leu lie Ala Pro Gin Asn Ser 
405 410 415 

He Asp Tyr Tyr Glu Ser Val Leu Ser Phe Phe Gly Ser Gly Lys Gin 
420 425 430 

Asp Arg Ala Gin Thr Val Arg Glu Val Gin Ser Phe Tyr Arg Leu Phe 
435 440 445 

Met Ala Pro Gly Met Ala His Cys Gly Gly Gly Thr Gly Pro Asn Ser 
450 455 460 

Phe Asp Met Leu Asp Ala Leu Glu Lys Trp Val Glu Gly Gly He Ala 
465 470 475 480 



Pro Glu Arg Val Leu Ala Thr Arg Ser He Asn Gly Val Val Asp Arg 
485 490 495 

Leu Arg Pro Leu Cys Pro Tyr Pro Gin Val Ala Val Tyr Lys Gly His 



Gly Asp Thr Asn Asp Ala Ala Asn Phe Val Cys Arg Asp 
515 520 525 
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